Recognizing Small- Circuit Structure in Two-Qubit Operators 
and Timing Hamiltonians to Compute Controlled-Not GatesE 
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This work proposes numerical tests which determine whether a two-qubit operator has an atypically simple 
quantum circuit. Specifically, we describe formulae, written in terms of matrix coefficients, characterizing 
operators implementable with exactly zero, one, or two controlled-not (CNOT) gates and all other gates being 
one-qubit. We give an algorithm for synthesizing two-qubit circuits with optimal number of CNOT gates, and 
illustrate it on operators appearing in quantum algorithms by Deutsch-Josza, Shor and Grover. In another 
application, our explicit numerical tests allow timing a given Hamiltonian to compute a CNOT modulo one- 
qubit gates, when this is possible. 
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I. INTRODUCTION 

Quantum circuits compactly represent unitary operators 
and find applications in quantum computing, communication 
and cryptography (lj]. Such a representation can often be 
interpreted as a program (e.g., a sequence of RF pulses for 
NMR) whose execution on a quantum system of choice per- 
forms a requested unitary evolution. Simple steps in the pro- 
gram correspond to gates in the circuit, and smaller circuits 
lead to faster programs. In this work we discuss exact imple- 
mentations of two-qubit operators because (i) such operators 
suffice to implement arbitrary operators |2], and (ii) a number 
of controllable two-qubit systems were recently reported. 

The simulation of generic two-qubit operators via CNOT 
gates and one-qubit operators has been thoroughly investi- 
gated, resulting in several three-CNOT decompositions jj, 

. It is known that the swap gate requires three CNOTs 
and also that an arbitrary n-qubit operator requires at least 
\\{A n — 3« — 1)1- The proof of this latter result 0] holds for 
any controlled-M gate, where u is a given fixed one-qubit oper- 
ator. For n = 2, it has been shown that an arbitrary controlled- 
u gate is generically worse than the CNOT 

The above-mentioned results motivate the focus on the 
basic-gate library 01, which consists of the CNOT gate and 
all one-qubit gates: it is powerful and well-understood. Yet, 
given the diversity of implementation technologies, it is not 
clear that the CNOT gate will be directly available in a given 
implementation. Nonetheless, we believe results expressed in 
the basic-gate library will be relevant. An analogous situation 
occurs in the design of (classical) integrated circuits. In this 
context, first technology-independent synthesis is performed 
in terms of abstract gates (AND, OR, NOT). Later, during 
technology mapping, circuits are converted to use gates that 
are specific to a given implementation technology (e.g., NOR, 
NAND and AOI gates, which require very few CMOS tran- 



sistors). Work in the direction of quantum technology map- 
ping includes techniques for expressing a CNOT gate in terms 
of a given entangling two-qubit gate and arbitrary one-qubit 
gates 1 8]. The simulation of CNOT gates with implementation- 
specific resources is the basis of a major physical implemen- 
tation technology |9]. 

The analogy with classical logic synthesis provides the fol- 
lowing additional intuition: operators useful in practice will 
not be the worst-case operators studied in the aforementioned 
works. This belief is confirmed by published quantum algo- 
rithms and communication protocols. It is therefore important 
for quantum logic synthesis techniques to detect when a given 
operator can be implemented using fewer gates than are nec- 
essary in the worst case. For some classes of operators, this 
is easy; e.g., the algorithm in 1 10] implements tensor-product 
operators without CNOTs. The matrix of a controlled-f/ oper- 
ator can be recognized by its pattern of zeros and ones (either 
directly, or after pre- and post-multiplication by wire swaps). 
Song and Klappenecker 1 1 1 ] study optimal implementations 
of two-qubit controlled-unitary operators, known to require 
up to two CNOT gates. They contribute a catalog of numerical 
tests that detect when zero, one or two CNOT gates are re- 
quired, and similar criteria for the number of basic one-qubit 
gates. 

We address a related question for arbitrary two-qubit oper- 
ators and contribute simple numerical tests to determine the 
minimal achievable number of CNOTs, including a novel one- 
CNOT test. We also generalize a two-CNOT test from 0] and 
make it easier to compute. Such explicit numerical tests fa- 
cilitate a new application. A given two-qubit Hamiltonian H, 
if timed precisely, may allow one to implement a CNOT using 
s' Hr and one-qubit gates. We show how to compute correct 
durations. 
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It is well known that an arbitrary one-qubit gate u can be 
written as u = ^R z (Q)Ry(^)R z (\]f) |1]. Furthermore, the 
Bloch sphere isomorphism suggests that the choice of y, z is 
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arbitrary in the sense that any pair of orthogonal vectors will 
do: in particular, we may write 

u = ^R z (e)R x (^)R z {\\f) = e®R x (a)R z (8)R x {$) 

These decompositions are more convenient when working 
with CNOT gates because R z gates commute through the con- 
trol of the CNOT whereas R x gates commute through the tar- 
get. We will denote by C k , a CNOT with control on the j-th 
wire and target on the £-th. For convenience, we consider the 
CNOT gate to be normalized to have determinant 1 . 

Additional conventions are as follows. For g any complex 
matrix, g' denotes the transpose and g* denotes the adjoint, 
i.e. the complex-conjugate transpose. Additionally, %(g) — 
p(x) = det (xl — g) denotes the characteristic polynomial of 
g. We use axis-dependent phase operators =/?*(tc/2), 
* = x,y,z- Finally, St/ (4) denotes the group of all determinant 
one unitary matrices, fixing the global phase of a two-qubit 
unitary operator up to ± 1 , ±z". 

We now consider when two-qubit operators u,v differ by 
pre- or post-composing with one-qubit operators and possibly 
by an irrelevant global phase. In this case, we write u = v 
and say that u and v are equivalent up to one-qubit gates. The 
following invariant characterizes when this occurs. 

Proposition II.l Let y:U(4)^U (4) be given by the formula 
u i — * u(a y f j2 u'(a y ) 02 . Then for u,v e 51/(4), u = v 
3C[y(")]=X[±Y(v)]- 

We defer the proof to the Appendix. However, note that this 
proof provides an explicit procedure for computing the one- 
qubit operators a,b,c,d 6 SU(2) such that (a (g) b)u(c <g> d) = 
e li, v in the event that %[y(m)] = 3C[±Y( V )]- We discuss Y more 
fully in the context of minimal universal two-qubit circuits 
Q|. Related invariants are discussed in lll2l[l3ll . and general- 
izations in fiUl . 



Proof: u is simulated using one CNOT gate iff u = C\ or 
u = C\. Now j(Cl) = -io z ® cy*; also j(C 2 ) = -ic x ® a z . 
Each has % given by (x + i) 2 (x — i) 2 . □ 

In particular, we see that C\ = C\. This can also be seen 
from the well-known identity (H H)C\(H ®H) = C\. We 
will use this fact for the final case, m — 2. 

Proposition III.3 An operator u € SU (4) can be simulated 
using two CNOT gates and arbitrary one-qubit gates from 
St/ (2) iff x[y( m )] nas a U rea l coefficients, which occurs iff 
tr\y{u)\ is real. 

Proof: Since C\ = C\, it is clear that u can be simulated us- 
ing two CNOT gates iff u = C\(a(i) b)C\. We decompose a = 
R x (a)R z (S)R x ($) decomposition and b = R z (B)R x (ty)R z (y), 
and pass R x gates and R z gates outward through the tar- 
get and control of the CNOT gates. Thus we are left 
with u = C 2 [R z (b) <£> R X (§)]C 2 . Explicit computation yields 

X\y(C 2 [Rz($)®Rxtt)]C 2 )] - (x + + e-<W)(x + 

gKMJj^+g-iCM)), On the other hand, if x[y(«)] has all real 
coefficients, then the eigenvalues come in conjugate pairs; it 
follows from this and Pronosition llLll that x[y( m )] i s as above 
for some 8,(|). 

Finally, we note that for u S SU (N), and %(u) — T[(x - A,,), 

we have Uh = !■ Thus %(u) = ^n^) 11^ - = I\(hx - 

1). It follows that the coefficient of x k is the complex conju- 
gate of the coefficient of x N ^ k . In particular, for N — 4, the 
coefficient of x 2 is real and the coefficients of x 3 ,x are tr(w) 
and its conjugate. Since the constant term and the x 4 coeffi- 
cient are 1, we see %(u) has all real coefficients iff tr(w) is real. 

□ 



IV. SYNTHESIS ALGORITHM AND ITS VALIDATION 



HI. OPTIMIZING CNOT-COUNT 

We now characterize which two-qubit operators admit a 
quantum circuit using only m CNOT gates. Since any two- 
qubit operator is implemented by some three CNOT circuit, 
the relevant cases are m = 0, 1,2. We begin with case m = 0. 

Proposition III.l An operator u £ SU (4) can be simulated 
using no CNOT gates and arbitrary one-qubit gates from 
SU{2)iff%[y{u)] = {x+\) A or{x-l)\ 

Proof: u can be simulated using no CNOT gates iff u = I. 
Thusx[Y(«)] =Z[±Y(/)] =X[±I] = (*±1) 4 - □ 
The case m = 1 is similar. Note that this test requires 
normalizing the global phase so that det(v) = 1, implicit in 
v e SU (4). Had we not normalized the CNOT gate, x[j(C 2 )] 
would not be of the form described. 

Proposition III.2 An operator u £ SU (4) can be simulated 
using one CNOT gate and arbitrary one-qubit gates from 
SU(2)iffx[j(u)} = (x + i) 2 (x-i) 2 . 



The results of Section ITTTl can be combined with the tech- 
niques of Propositions IIII.3I and III. 1 1 and the published litera- 
ture to yield an explicit circuit synthesis algorithm: 

• Given the matrix of a unitary operator u € 1/(4), divide 
it by v/ det(#) to ensure u £ SU (4). 

• Compute x[y( m )] to determine whether u requires zero, 
one, two, or three CNOT gates. 

• If u requires zero or one CNOT gates, use the techniques 
of the proof of Proposition IVI.ll to determine which 
one-qubit operators are required. 

• If u requires two CNOT gates, find the roots of %[y(u)] 
and determine the 8, (j) of Proposition MI. 3 1 Then use 
the methods of Proposition lVI. ll to determine what one- 
qubit gates are required at the ends of the circuit. 

• Finally, if u requires three CNOT gates, apply the meth- 
ods of the literature ]5[ . 
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By construction, the algorithm produces CNOT-optimal cir- 
cuits in all cases. It also outperforms those in O H. l5l [KM in 
important special cases, as shown below. 
Example IV.l Many quantum algorithms, notably Graver's 
quantum search 1 15] and Shor's number factoring | 16J, use 
the operator u = H ®H to create superpositions. Computing 
Y(m) allows our synthesis algorithm to recognize that u admits 
a quantum circuit containing no CNOTs. O 

This example is less trivial than it seems: while writing 
u = H (g)H makes it obvious that u requires no CNOT gates, a 
synthesis procedure will not receive an input of u = H®Hbut 
rather of the 4x4 matrix corresponding to u. It is not a priori 
clear that any worst-case CNOT-optimal circuit decomposition 
will implement u without CNOT gates. However, several pre- 
viously published algorithms do. For the next example, previ- 
ous two-qubit synthesis techniques produce circuits with more 
CNOTs than necessary. 

Example IV.2 The operator u that swaps 00) <-» |01) while 
fixing 1 10} and 1 11) plays a prominent role in the Deutsch- 
Josza algorithm lOO. Note that C\(I <8> CJjc) simulates u. 
Computing y(e' n / 4 u) reveals that u requires only one CNOT. 
However, depending on certain algorithmic choices, anywhere 
from one to four one-qubit gates could appear. In any event, 
this compares favorably to previous work which synthe- 
sizes a circuit with two CNOT and five one-qubit gates. O 

The algorithmic choices mentioned above come in two fla- 
vors. First, as the two CNOT gates C\ and C\ differ only by 
one-qubit gates, they are equivalent from the perspective of 
our methods. However, the number of one-qubit gates present 
in the resulting circuit depends on which of these is chosen. 
This is a finite problem: at most three CNOT gates appear and 
thus there are at most 8 possibilities, so we simply run through 
them all. Additional degrees of freedom arise in finding a cir- 
cuit that computes a given v using a given u and one-qubit 
operators, when this is possible. The proof proof of Proposi- 
tion l VI. 1 I describes an algorithm for this, and requires picking 
a basis of eigenvectors for a certain matrix. If the eigenval- 
ues are distinct, the only degree of freedom is the ordering 
of the basis of eigenvectors (4! = 24 possibilities). However, 
repeated eigenvalues allow more flexibility in choosing basis 
vectors, and potentially non-trivial circuit optimizations. 

Example IV.3 At the heart of Shor's factoring algorithm 
llT6ll is the Quantum Fourier Transform [ 1 ]. On two qubits, it 
is given by the following matrix. 




Explicit computation of %[y(T )] reveals that two CNOT gates 
do not suffice to simulate f . Thus, the following circuit to 
compute f is CNOT-optimal: 



-e 



Above, T- = e-'^/s an d S y = <r''<*V 4 . Note that this cir- 
cuit requires only three one-qubit gates, although two of these 
have been broken up for clarity. Finally, given that this circuit 
is CNOT-optimal, it is not difficult to check by hand that its 
basic-gate count cannot be improved. O 



V. TIMING A HAMILTONIAN TO COMPUTE CNOT 

Our numerical tests facilitate a new application. Given a 
Hamiltonian that can be timed to compute a CNOT modulo 
one-qubit gates, we illustrate finding the correct duration. Our 
example is a perturbation of o x ® O x by non-commutative one- 
qubit noise. 

H 42 = (0.42)7 ® cr + a* ® <x v 

Recall that a CNOT can be constructed using one-qubit gates 
and some time-iterate of the Hamiltonian a x ® cr*. However, 
to handle the noise term, existing techniques resort to Trotter- 
ization, which implements exp(A +B) by separately turning 
on A and B for short periods of time. Below we find a sim- 
pler, direct implementation of CNOT from f/42. It is especially 
interesting in light of concerns about the scalability of Trot- 
terization ifTsil . 

We compute y(e ,//42 ') for uniformly-spaced trial values of t 
and seek out those values at which the characteristic polyno- 
mial nears p(x) = (x 2 + l) 2 = x 4 + 2x 2 + 1. Our implemen- 
tation in C++ finds ?cnot = 0.80587 in twenty seconds on a 
common workstation. Hence, we produce a CNOT from H42 
and one-qubit gates without Trotterization. Specifically, since 
e !#42'cN0T implements C\ up to one-qubit operators, we use the 
technique of Proposition IVI.ll to compute the relevant one- 
qubit operators. We find that the matrices 



«2 



b 2 = 



l-i -1 
l + i 



iVi) -2 = 0.707107 , 



-0.21503 



0.976607/ 




-0.21503 





0.976607/ 



0.152049 + 0.690566/ 0.690566-0.152049/ 
-0.690566-0.152049/ 0.152049-0.690566/ 



satisfy C\ = (02 ® b2)e' H42 ' a " >T (02 ®d.2) with numerical preci- 
sion of 10~ 6 . 

Further numerical experiments suggest that building a 
CNOT is possible whenever 0.42 is replaced by a weight w, 
< w < 1. However, we have no analytical proof of this. Nu- 
merical experiments also suggest the impossibility of timing 
the Hamiltonian H X yz = G x ® G* + <7 >- ® O y + O z <g> O z so as 
to compute a CNOT. In other words, trying values of t in the 
range — 10 < f < 10 as above produced no candidate durations. 
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VI. CONCLUSIONS AND FUTURE WORK 

Our work addresses small-circuit structure in two-qubit uni- 
tary operators. In particular, we contribute tests for such struc- 
ture, and our techniques can be viewed as algorithms for find- 
ing small circuits when they exist. We detail such an algo- 
rithm that produces the minimal possible number of CNOT 
gates (zero, one, two or three) for each input. It is illustrated 
on circuit examples derived from well-known applications. 

The one-CNOT test has an additional use. It provides a nu- 
merical method for timing a given two-qubit Hamiltonian H 
so that e" H realizes a CNOT gate up to local unitary operators 
(one-qubit gates,) given this is possible for H. 

Acknowledgments. This work is supported by the DARPA 
QuIST program and an NSF grant. The views and conclu- 
sions contained herein are those of the authors and should not 
be interpreted as necessarily representing official policies or 
endorsements of employers and funding agencies. 

Appendix 

Proposition VI.l Lety: SU(4) — > SU(4) be given by the for- 
mula u^ u(c y )® 2 u' (o y )® 2 . Thenforu,veSU(4),u = v <^ 
3C[y(")] = X[±Y(v)]- 

Proof: By definition, u = v <^=> u = (a(gib)Xv(a' (gib') for 



some one-qubit operators a,b,a',b' and some scalar X. Re- 
quiring m, v G St/ (4) implies A, = ±l,±i. We show below that 
u = (a(gb)v(a' (gib') %[y(u)} = x[y( v )]; me proposition 

then follows from the fact that y(iu) = —y(u). 

We recall that there exist E G SU (4) such that 
E 50(4) E* = SU(2)® 2 = {a®b:a,b£ SU{2)}. Such ma- 
trices are characterized by the property that EE' = —o y ®G y . 
This and related issues have been exhaustively dealt with in 
several papers JH E3 . 

The property %[y(u)] = %[y(v)] is not changed by replacing 
y with E*yE. Using the fact &' <8> a y = EE' = (EE')*, we 
compute: E*y{u)E = E*uEE'u'E'*E*E = (E*uE)(E*uE)' 

By making the substitution u i— > EuE*; it suffices to prove: 
for h,v6 SU (4), there exists x,y G 5(9(4) such that xuy = v iff 
X[uu'] = %[vv'}. Here, 5(9(4) is the real matrices within 517(4). 

Note that for P symmetric unitary, P 1 = P, hence [P + 
P,P — P] = 0. It follows that the real and imaginary parts of P 
share an orthonormal basis of eigenvectors. As they are more- 
over real symmetric matrices, we know from the spectral theo- 
rem that their eigenvectors can be taken to be real. Thus there 
exists q G 5(9(4) such that quu'q* is diagonal. By re-ordering 
(and negating) the columns of q, we can re-order the diagonal 
elements of quu'q* as desired. Thus if %[uu'] = %[vv'], we can 
find q,r G 5(9(4) such that quu'q' = rvv' V by diagonalizing 
both; then (vV^)(vV^)' = /. Let s = v*r J qu G 5(9(4). We 
have q'rvs — u, as desired. □ 
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